Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 2699720 |
| Missing cells | 59605 |
| Missing cells (%) | 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 309.0 MiB |
| Average record size in memory | 120.0 B |
Variable types
| Numeric | 11 |
|---|---|
| Categorical | 4 |
mission_id has a high cardinality: 263 distinct values | High cardinality |
geo_country has a high cardinality: 220 distinct values | High cardinality |
event_timestamp has a high cardinality: 2698313 distinct values | High cardinality |
mission_difficulty is highly correlated with mission_stars_collected | High correlation |
mission_stars_collected is highly correlated with mission_difficulty and 3 other fields | High correlation |
day_auto_increment is highly correlated with days_played_in_month | High correlation |
lifetime_played_runs is highly correlated with mission_stars_collected and 2 other fields | High correlation |
max_run_distance is highly correlated with mission_stars_collected and 1 other fields | High correlation |
total_purchases_virtual is highly correlated with virtual_currency_balance | High correlation |
total_ads_watched is highly correlated with mission_stars_collected and 1 other fields | High correlation |
days_played_in_month is highly correlated with day_auto_increment | High correlation |
virtual_currency_balance is highly correlated with total_purchases_virtual | High correlation |
day_auto_increment is highly correlated with total_purchases_virtual and 1 other fields | High correlation |
total_purchases_virtual is highly correlated with day_auto_increment and 1 other fields | High correlation |
days_played_in_month is highly correlated with day_auto_increment and 1 other fields | High correlation |
mission_stars_collected is highly correlated with lifetime_played_runs | High correlation |
day_auto_increment is highly correlated with days_played_in_month | High correlation |
lifetime_played_runs is highly correlated with mission_stars_collected | High correlation |
days_played_in_month is highly correlated with day_auto_increment | High correlation |
day_auto_increment is highly correlated with total_purchases_virtual and 1 other fields | High correlation |
max_run_distance is highly correlated with days_played_in_month | High correlation |
total_purchases_virtual is highly correlated with day_auto_increment and 1 other fields | High correlation |
days_played_in_month is highly correlated with day_auto_increment and 2 other fields | High correlation |
mission_stars_collected is highly skewed (γ1 = 158.1877012) | Skewed |
day_auto_increment is highly skewed (γ1 = 74.28247207) | Skewed |
total_purchases_virtual is highly skewed (γ1 = 284.0938017) | Skewed |
total_purchases_real is highly skewed (γ1 = 67.72103605) | Skewed |
days_played_in_month is highly skewed (γ1 = 130.1008803) | Skewed |
virtual_currency_balance is highly skewed (γ1 = 57.32590572) | Skewed |
event_timestamp is uniformly distributed | Uniform |
day_auto_increment has 1689025 (62.6%) zeros | Zeros |
total_purchases_virtual has 1322796 (49.0%) zeros | Zeros |
total_ads_watched has 1401357 (51.9%) zeros | Zeros |
total_purchases_real has 2676752 (99.1%) zeros | Zeros |
days_played_in_month has 1819428 (67.4%) zeros | Zeros |
Reproduction
| Analysis started | 2022-05-23 14:23:54.476962 |
|---|---|
| Analysis finished | 2022-05-23 14:33:28.176095 |
| Duration | 9 minutes and 33.7 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
df_index
Real number (ℝ≥0)
| Distinct | 269972 |
|---|---|
| Distinct (%) | 10.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 151174.3564 |
| Minimum | 0 |
|---|---|
| Maximum | 290201 |
| Zeros | 10 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 13573 |
| Q1 | 86030.75 |
| median | 154024.5 |
| Q3 | 222069.25 |
| 95-th percentile | 276579 |
| Maximum | 290201 |
| Range | 290201 |
| Interquartile range (IQR) | 136038.5 |
Descriptive statistics
| Standard deviation | 82896.61638 |
|---|---|
| Coefficient of variation (CV) | 0.5483510456 |
| Kurtosis | -1.099037267 |
| Mean | 151174.3564 |
| Median Absolute Deviation (MAD) | 68019.5 |
| Skewness | -0.1286024473 |
| Sum | 4.081284334 × 1011 |
| Variance | 6871849007 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 0 | 10 | < 0.1% |
| 199378 | 10 | < 0.1% |
| 199380 | 10 | < 0.1% |
| 199381 | 10 | < 0.1% |
| 199382 | 10 | < 0.1% |
| 199383 | 10 | < 0.1% |
| 199384 | 10 | < 0.1% |
| 199385 | 10 | < 0.1% |
| 199386 | 10 | < 0.1% |
| 199387 | 10 | < 0.1% |
| Other values (269962) | 2699620 |
| Value | Count | Frequency (%) |
| 0 | 10 | |
| 1 | 10 | |
| 2 | 10 | |
| 3 | 10 | |
| 4 | 10 | |
| 5 | 10 | |
| 6 | 10 | |
| 7 | 10 | |
| 8 | 10 | |
| 9 | 10 |
| Value | Count | Frequency (%) |
| 290201 | 10 | |
| 290200 | 10 | |
| 290199 | 10 | |
| 290198 | 10 | |
| 290197 | 10 | |
| 290196 | 10 | |
| 290195 | 10 | |
| 290194 | 10 | |
| 290193 | 10 | |
| 290192 | 10 |
user_pseudo_id
Real number (ℝ≥0)
| Distinct | 269972 |
|---|---|
| Distinct (%) | 10.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49976825.34 |
| Minimum | 794 |
|---|---|
| Maximum | 99999617 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.6 MiB |
Quantile statistics
| Minimum | 794 |
|---|---|
| 5-th percentile | 5012802 |
| Q1 | 24958010 |
| median | 50039583.5 |
| Q3 | 74913214 |
| 95-th percentile | 94925500 |
| Maximum | 99999617 |
| Range | 99998823 |
| Interquartile range (IQR) | 49955204 |
Descriptive statistics
| Standard deviation | 28863326.06 |
|---|---|
| Coefficient of variation (CV) | 0.5775342044 |
| Kurtosis | -1.2019779 |
| Mean | 49976825.34 |
| Median Absolute Deviation (MAD) | 24977416.5 |
| Skewness | -0.001803132403 |
| Sum | 1.349234349 × 1014 |
| Variance | 8.330915914 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15725157 | 10 | < 0.1% |
| 34723286 | 10 | < 0.1% |
| 71284638 | 10 | < 0.1% |
| 62129171 | 10 | < 0.1% |
| 18575574 | 10 | < 0.1% |
| 93956116 | 10 | < 0.1% |
| 76307226 | 10 | < 0.1% |
| 70752557 | 10 | < 0.1% |
| 66093111 | 10 | < 0.1% |
| 6430248 | 10 | < 0.1% |
| Other values (269962) | 2699620 |
| Value | Count | Frequency (%) |
| 794 | 10 | |
| 943 | 10 | |
| 1030 | 10 | |
| 2511 | 10 | |
| 2723 | 10 | |
| 2991 | 10 | |
| 3194 | 10 | |
| 3842 | 10 | |
| 5480 | 10 | |
| 6213 | 10 |
| Value | Count | Frequency (%) |
| 99999617 | 10 | |
| 99999432 | 10 | |
| 99999334 | 10 | |
| 99998701 | 10 | |
| 99998367 | 10 | |
| 99998240 | 10 | |
| 99997868 | 10 | |
| 99997620 | 10 | |
| 99997310 | 10 | |
| 99996452 | 10 |
| Distinct | 263 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 20.6 MiB |
| Mission3 | |
|---|---|
| Mission115 | |
| Mission86 | |
| Mission114 | |
| Mission19 | |
| Other values (258) |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 9.194731378 |
| Min length | 8 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 36 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Mission94 |
|---|---|
| 2nd row | Mission11 |
| 3rd row | Mission6 |
| 4th row | Mission3 |
| 5th row | Mission114 |
Common Values
| Value | Count | Frequency (%) |
| Mission3 | 279709 | |
| Mission115 | 279560 | |
| Mission86 | 271341 | |
| Mission114 | 261628 | |
| Mission19 | 261235 | |
| Mission11 | 260368 | |
| Mission109 | 256450 | |
| Mission113 | 256398 | |
| Mission6 | 249904 | |
| Mission12 | 188743 | |
| Other values (253) | 134383 |
Length
| Value | Count | Frequency (%) |
| mission3 | 279709 | |
| mission115 | 279560 | |
| mission86 | 271341 | |
| mission114 | 261628 | |
| mission19 | 261235 | |
| mission11 | 260368 | |
| mission109 | 256450 | |
| mission113 | 256398 | |
| mission6 | 249904 | |
| mission12 | 188743 | |
| Other values (253) | 134383 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 20.6 MiB |
| 1.0 | |
|---|---|
| 2.0 | |
| 3.0 | 33 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 1946778 | |
| 2.0 | 752908 | 27.9% |
| 3.0 | 33 | < 0.1% |
| (Missing) | 1 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1.0 | 1946778 | |
| 2.0 | 752908 | 27.9% |
| 3.0 | 33 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 78 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.509476357 |
| Minimum | 0 |
|---|---|
| Maximum | 1772 |
| Zeros | 6201 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 4 |
| median | 7 |
| Q3 | 11 |
| 95-th percentile | 14 |
| Maximum | 1772 |
| Range | 1772 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 5.035307679 |
|---|---|
| Coefficient of variation (CV) | 0.6705271366 |
| Kurtosis | 55000.28863 |
| Mean | 7.509476357 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 158.1877012 |
| Sum | 20273476 |
| Variance | 25.35432342 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 485088 | |
| 6 | 278386 | |
| 8 | 261153 | |
| 4 | 233907 | |
| 7 | 216019 | |
| 5 | 208218 | |
| 11 | 194310 | |
| 14 | 178627 | 6.6% |
| 9 | 177107 | 6.6% |
| 12 | 131572 | 4.9% |
| Other values (68) | 335332 |
| Value | Count | Frequency (%) |
| 0 | 6201 | 0.2% |
| 1 | 16610 | 0.6% |
| 2 | 7466 | 0.3% |
| 3 | 485088 | |
| 4 | 233907 | |
| 5 | 208218 | |
| 6 | 278386 | |
| 7 | 216019 | |
| 8 | 261153 | |
| 9 | 177107 | 6.6% |
| Value | Count | Frequency (%) |
| 1772 | 1 | |
| 1769 | 2 | |
| 1768 | 1 | |
| 1766 | 1 | |
| 1765 | 1 | |
| 1761 | 2 | |
| 1760 | 1 | |
| 1759 | 1 | |
| 280 | 1 | |
| 277 | 1 |
day_auto_increment
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 54 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1811.976636 |
| Minimum | -1 |
|---|---|
| Maximum | 10000001 |
| Zeros | 1689025 |
| Zeros (%) | 62.6% |
| Negative | 7328 |
| Negative (%) | 0.3% |
| Memory size | 20.6 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 10000001 |
| Range | 10000002 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 134572.5713 |
|---|---|
| Coefficient of variation (CV) | 74.26838106 |
| Kurtosis | 5515.889744 |
| Mean | 1811.976636 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 74.28247207 |
| Sum | 4891811444 |
| Variance | 1.810977694 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1689025 | |
| 1 | 613906 | 22.7% |
| 2 | 204795 | 7.6% |
| 3 | 87440 | 3.2% |
| 4 | 42314 | 1.6% |
| 5 | 22707 | 0.8% |
| 6 | 12144 | 0.4% |
| -1 | 7328 | 0.3% |
| 7 | 7064 | 0.3% |
| 8 | 4230 | 0.2% |
| Other values (44) | 8757 | 0.3% |
| Value | Count | Frequency (%) |
| -1 | 7328 | 0.3% |
| 0 | 1689025 | |
| 1 | 613906 | 22.7% |
| 2 | 204795 | 7.6% |
| 3 | 87440 | 3.2% |
| 4 | 42314 | 1.6% |
| 5 | 22707 | 0.8% |
| 6 | 12144 | 0.4% |
| 7 | 7064 | 0.3% |
| 8 | 4230 | 0.2% |
| Value | Count | Frequency (%) |
| 10000001 | 9 | < 0.1% |
| 10000000 | 53 | < 0.1% |
| 9999999 | 427 | |
| 58 | 1 | < 0.1% |
| 52 | 1 | < 0.1% |
| 51 | 1 | < 0.1% |
| 50 | 6 | < 0.1% |
| 46 | 1 | < 0.1% |
| 45 | 1 | < 0.1% |
| 44 | 1 | < 0.1% |
| Distinct | 285 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7965 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.175839554 |
| Minimum | 0 |
|---|---|
| Maximum | 632 |
| Zeros | 5955 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 8 |
| 95-th percentile | 19 |
| Maximum | 632 |
| Range | 632 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 7.939005078 |
|---|---|
| Coefficient of variation (CV) | 1.285494062 |
| Kurtosis | 183.4860265 |
| Mean | 6.175839554 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 7.600313068 |
| Sum | 16623847 |
| Variance | 63.02780163 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 544242 | |
| 1 | 418944 | |
| 3 | 329511 | |
| 4 | 238250 | |
| 5 | 191507 | 7.1% |
| 6 | 158984 | 5.9% |
| 7 | 129347 | 4.8% |
| 8 | 105996 | 3.9% |
| 9 | 84545 | 3.1% |
| 10 | 69151 | 2.6% |
| Other values (275) | 421278 |
| Value | Count | Frequency (%) |
| 0 | 5955 | 0.2% |
| 1 | 418944 | |
| 2 | 544242 | |
| 3 | 329511 | |
| 4 | 238250 | |
| 5 | 191507 | 7.1% |
| 6 | 158984 | 5.9% |
| 7 | 129347 | 4.8% |
| 8 | 105996 | 3.9% |
| 9 | 84545 | 3.1% |
| Value | Count | Frequency (%) |
| 632 | 1 | < 0.1% |
| 454 | 10 | |
| 408 | 1 | < 0.1% |
| 406 | 1 | < 0.1% |
| 403 | 1 | < 0.1% |
| 402 | 3 | < 0.1% |
| 389 | 1 | < 0.1% |
| 387 | 2 | < 0.1% |
| 386 | 2 | < 0.1% |
| 385 | 3 | < 0.1% |
| Distinct | 9702 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 7965 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2387.397828 |
| Minimum | 0 |
|---|---|
| Maximum | 57880 |
| Zeros | 5981 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1140 |
| Q1 | 1502 |
| median | 2135 |
| Q3 | 2907 |
| 95-th percentile | 4632 |
| Maximum | 57880 |
| Range | 57880 |
| Interquartile range (IQR) | 1405 |
Descriptive statistics
| Standard deviation | 1197.57174 |
|---|---|
| Coefficient of variation (CV) | 0.5016221956 |
| Kurtosis | 24.05959994 |
| Mean | 2387.397828 |
| Median Absolute Deviation (MAD) | 677 |
| Skewness | 2.252180527 |
| Sum | 6426290040 |
| Variance | 1434178.073 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5981 | 0.2% |
| 1224 | 1710 | 0.1% |
| 1145 | 1707 | 0.1% |
| 1167 | 1705 | 0.1% |
| 1160 | 1702 | 0.1% |
| 1311 | 1700 | 0.1% |
| 1201 | 1694 | 0.1% |
| 1181 | 1684 | 0.1% |
| 1234 | 1682 | 0.1% |
| 1312 | 1673 | 0.1% |
| Other values (9692) | 2670517 | |
| (Missing) | 7965 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 5981 | |
| 103 | 1 | < 0.1% |
| 104 | 3 | < 0.1% |
| 108 | 2 | < 0.1% |
| 113 | 2 | < 0.1% |
| 116 | 1 | < 0.1% |
| 117 | 2 | < 0.1% |
| 118 | 3 | < 0.1% |
| 119 | 4 | < 0.1% |
| 122 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 57880 | 9 | |
| 28463 | 1 | < 0.1% |
| 27342 | 9 | |
| 26452 | 10 | |
| 24648 | 10 | |
| 22371 | 4 | < 0.1% |
| 21744 | 1 | < 0.1% |
| 20807 | 4 | < 0.1% |
| 20153 | 2 | < 0.1% |
| 20053 | 3 | < 0.1% |
total_purchases_virtual
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 5377 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 8062 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 51568.88489 |
| Minimum | 0 |
|---|---|
| Maximum | 1245420000 |
| Zeros | 1322796 |
| Zeros (%) | 49.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 500 |
| Q3 | 4000 |
| 95-th percentile | 11000 |
| Maximum | 1245420000 |
| Range | 1245420000 |
| Interquartile range (IQR) | 4000 |
Descriptive statistics
| Standard deviation | 3304984.35 |
|---|---|
| Coefficient of variation (CV) | 64.08873019 |
| Kurtosis | 96175.47529 |
| Mean | 51568.88489 |
| Median Absolute Deviation (MAD) | 500 |
| Skewness | 284.0938017 |
| Sum | 1.388058016 × 1011 |
| Variance | 1.092292155 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1322796 | |
| 1500 | 315472 | 11.7% |
| 4000 | 128332 | 4.8% |
| 5500 | 87348 | 3.2% |
| 3000 | 80625 | 3.0% |
| 500 | 65728 | 2.4% |
| 4500 | 64536 | 2.4% |
| 6000 | 58395 | 2.2% |
| 1000 | 48742 | 1.8% |
| 2000 | 45576 | 1.7% |
| Other values (5367) | 474108 | 17.6% |
| Value | Count | Frequency (%) |
| 0 | 1322796 | |
| 500 | 65728 | 2.4% |
| 1000 | 48742 | 1.8% |
| 1500 | 315472 | 11.7% |
| 2000 | 45576 | 1.7% |
| 2500 | 37774 | 1.4% |
| 3000 | 80625 | 3.0% |
| 3500 | 24003 | 0.9% |
| 4000 | 128332 | 4.8% |
| 4500 | 64536 | 2.4% |
| Value | Count | Frequency (%) |
| 1245420000 | 2 | < 0.1% |
| 1243550000 | 8 | |
| 1093650000 | 3 | < 0.1% |
| 1083550000 | 1 | < 0.1% |
| 480000000 | 9 | |
| 470000000 | 1 | < 0.1% |
| 463547000 | 7 | |
| 434526000 | 2 | < 0.1% |
| 393856000 | 10 | |
| 223548000 | 10 |
| Distinct | 170 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 8058 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.432695487 |
| Minimum | 0 |
|---|---|
| Maximum | 239 |
| Zeros | 1401357 |
| Zeros (%) | 51.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 3 |
| 95-th percentile | 11 |
| Maximum | 239 |
| Range | 239 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 5.001427299 |
|---|---|
| Coefficient of variation (CV) | 2.055919997 |
| Kurtosis | 82.39498735 |
| Mean | 2.432695487 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.929526616 |
| Sum | 6547994 |
| Variance | 25.01427503 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1401357 | |
| 1 | 320961 | 11.9% |
| 2 | 217961 | 8.1% |
| 3 | 161983 | 6.0% |
| 4 | 122860 | 4.6% |
| 5 | 93524 | 3.5% |
| 6 | 71118 | 2.6% |
| 7 | 55087 | 2.0% |
| 8 | 42428 | 1.6% |
| 9 | 33425 | 1.2% |
| Other values (160) | 170958 | 6.3% |
| Value | Count | Frequency (%) |
| 0 | 1401357 | |
| 1 | 320961 | 11.9% |
| 2 | 217961 | 8.1% |
| 3 | 161983 | 6.0% |
| 4 | 122860 | 4.6% |
| 5 | 93524 | 3.5% |
| 6 | 71118 | 2.6% |
| 7 | 55087 | 2.0% |
| 8 | 42428 | 1.6% |
| 9 | 33425 | 1.2% |
| Value | Count | Frequency (%) |
| 239 | 1 | < 0.1% |
| 235 | 1 | < 0.1% |
| 230 | 1 | < 0.1% |
| 225 | 1 | < 0.1% |
| 220 | 1 | < 0.1% |
| 217 | 1 | < 0.1% |
| 205 | 1 | < 0.1% |
| 202 | 1 | < 0.1% |
| 201 | 3 | |
| 198 | 1 | < 0.1% |
| Distinct | 606 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 8120 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.02275532026 |
| Minimum | 0 |
|---|---|
| Maximum | 133.93 |
| Zeros | 2676752 |
| Zeros (%) | 99.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 133.93 |
| Range | 133.93 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.6200074983 |
|---|---|
| Coefficient of variation (CV) | 27.246705 |
| Kurtosis | 7016.985764 |
| Mean | 0.02275532026 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 67.72103605 |
| Sum | 61248.22 |
| Variance | 0.3844092979 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2676752 | |
| 1.99 | 2139 | 0.1% |
| 0.48 | 1158 | < 0.1% |
| 0.49 | 814 | < 0.1% |
| 4.98 | 661 | < 0.1% |
| 0.99 | 598 | < 0.1% |
| 0.37 | 445 | < 0.1% |
| 2.99 | 420 | < 0.1% |
| 3.99 | 356 | < 0.1% |
| 0.11 | 261 | < 0.1% |
| Other values (596) | 7996 | 0.3% |
| (Missing) | 8120 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 2676752 | |
| 0.11 | 261 | < 0.1% |
| 0.13 | 5 | < 0.1% |
| 0.14 | 25 | < 0.1% |
| 0.22 | 71 | < 0.1% |
| 0.27 | 2 | < 0.1% |
| 0.3 | 1 | < 0.1% |
| 0.31 | 48 | < 0.1% |
| 0.32 | 29 | < 0.1% |
| 0.33 | 155 | < 0.1% |
| Value | Count | Frequency (%) |
| 133.93 | 1 | < 0.1% |
| 96.69 | 4 | < 0.1% |
| 94.7 | 1 | < 0.1% |
| 92.53 | 3 | < 0.1% |
| 79.91 | 6 | |
| 78.76 | 4 | < 0.1% |
| 73.94 | 3 | < 0.1% |
| 73.93 | 10 | |
| 67.83 | 10 | |
| 64.97 | 6 |
| Distinct | 220 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3482 |
| Missing (%) | 0.1% |
| Memory size | 20.6 MiB |
| United States | |
|---|---|
| Mexico | |
| Brazil | 132349 |
| France | 112590 |
| Russia | 108295 |
| Other values (215) |
Length
| Max length | 24 |
|---|---|
| Median length | 7 |
| Mean length | 8.137137374 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Honduras |
|---|---|
| 2nd row | Honduras |
| 3rd row | Honduras |
| 4th row | Honduras |
| 5th row | Honduras |
Common Values
| Value | Count | Frequency (%) |
| United States | 450568 | 16.7% |
| Mexico | 342940 | 12.7% |
| Brazil | 132349 | 4.9% |
| France | 112590 | 4.2% |
| Russia | 108295 | 4.0% |
| India | 96678 | 3.6% |
| Germany | 91358 | 3.4% |
| Turkey | 83295 | 3.1% |
| United Kingdom | 83167 | 3.1% |
| Colombia | 68467 | 2.5% |
| Other values (210) | 1126531 |
Length
| Value | Count | Frequency (%) |
| united | 540059 | |
| states | 450568 | 13.5% |
| mexico | 342940 | 10.3% |
| brazil | 132349 | 4.0% |
| france | 112590 | 3.4% |
| russia | 108295 | 3.2% |
| india | 96678 | 2.9% |
| germany | 91358 | 2.7% |
| turkey | 83295 | 2.5% |
| kingdom | 83167 | 2.5% |
| Other values (249) | 1292045 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
days_played_in_month
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 28 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7974 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 591.1845683 |
| Minimum | -1 |
|---|---|
| Maximum | 9999999 |
| Zeros | 1819428 |
| Zeros (%) | 67.4% |
| Negative | 7657 |
| Negative (%) | 0.3% |
| Memory size | 20.6 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 9999999 |
| Range | 10000000 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 76854.39583 |
|---|---|
| Coefficient of variation (CV) | 130.0006799 |
| Kurtosis | 16924.25162 |
| Mean | 591.1845683 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 130.1008803 |
| Sum | 1591318697 |
| Variance | 5906598158 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1819428 | |
| 1 | 593513 | 22.0% |
| 2 | 166533 | 6.2% |
| 3 | 59488 | 2.2% |
| 4 | 24227 | 0.9% |
| 5 | 10926 | 0.4% |
| -1 | 7657 | 0.3% |
| 6 | 4881 | 0.2% |
| 7 | 2318 | 0.1% |
| 8 | 1187 | < 0.1% |
| Other values (18) | 1588 | 0.1% |
| (Missing) | 7974 | 0.3% |
| Value | Count | Frequency (%) |
| -1 | 7657 | 0.3% |
| 0 | 1819428 | |
| 1 | 593513 | 22.0% |
| 2 | 166533 | 6.2% |
| 3 | 59488 | 2.2% |
| 4 | 24227 | 0.9% |
| 5 | 10926 | 0.4% |
| 6 | 4881 | 0.2% |
| 7 | 2318 | 0.1% |
| 8 | 1187 | < 0.1% |
| Value | Count | Frequency (%) |
| 9999999 | 159 | |
| 25 | 1 | < 0.1% |
| 24 | 3 | < 0.1% |
| 23 | 3 | < 0.1% |
| 22 | 2 | < 0.1% |
| 21 | 1 | < 0.1% |
| 20 | 5 | < 0.1% |
| 19 | 3 | < 0.1% |
| 18 | 7 | < 0.1% |
| 17 | 13 | < 0.1% |
| Distinct | 46931 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 7965 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 497783.2425 |
| Minimum | 0 |
|---|---|
| Maximum | 2146000000 |
| Zeros | 216 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 603 |
| Q1 | 4014 |
| median | 5648 |
| Q3 | 6992 |
| 95-th percentile | 11459 |
| Maximum | 2146000000 |
| Range | 2146000000 |
| Interquartile range (IQR) | 2978 |
Descriptive statistics
| Standard deviation | 21832254.87 |
|---|---|
| Coefficient of variation (CV) | 43.85895909 |
| Kurtosis | 3601.872758 |
| Mean | 497783.2425 |
| Median Absolute Deviation (MAD) | 1526 |
| Skewness | 57.32590572 |
| Sum | 1.339910532 × 1012 |
| Variance | 4.766473528 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5579 | 1862 | 0.1% |
| 5570 | 1841 | 0.1% |
| 5554 | 1834 | 0.1% |
| 5545 | 1801 | 0.1% |
| 5571 | 1793 | 0.1% |
| 5574 | 1793 | 0.1% |
| 5559 | 1769 | 0.1% |
| 5562 | 1769 | 0.1% |
| 5566 | 1766 | 0.1% |
| 5548 | 1759 | 0.1% |
| Other values (46921) | 2673768 | |
| (Missing) | 7965 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 216 | |
| 1 | 166 | |
| 2 | 183 | |
| 3 | 174 | |
| 4 | 177 | |
| 5 | 182 | |
| 6 | 159 | |
| 7 | 181 | |
| 8 | 165 | |
| 9 | 155 |
| Value | Count | Frequency (%) |
| 2146000000 | 9 | |
| 2000000000 | 4 | < 0.1% |
| 1999990000 | 1 | < 0.1% |
| 1997650000 | 2 | < 0.1% |
| 1996150000 | 3 | < 0.1% |
| 1996140000 | 5 | |
| 1995620000 | 10 | |
| 1995250000 | 10 | |
| 1994110000 | 5 | |
| 1994100000 | 8 |
| Distinct | 2698313 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 20.6 MiB |
| 2022-02-13 17:40:39.244 UTC | 3 |
|---|---|
| 2022-02-26 19:17:49.008 UTC | 2 |
| 2022-02-04 23:57:35.289 UTC | 2 |
| 2022-02-17 17:52:47.03 UTC | 2 |
| 2022-02-25 12:51:13.228 UTC | 2 |
| Other values (2698308) |
Length
| Max length | 30 |
|---|---|
| Median length | 27 |
| Mean length | 26.8916328 |
| Min length | 23 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 2696908 ? |
|---|---|
| Unique (%) | 99.9% |
Sample
| 1st row | 2022-03-02 15:46:04.025 UTC |
|---|---|
| 2nd row | 2022-03-14 02:00:06.716 UTC |
| 3rd row | 2022-03-14 02:01:13.236 UTC |
| 4th row | 2022-03-05 17:56:06.473 UTC |
| 5th row | 2022-03-05 17:52:48.383 UTC |
Common Values
| Value | Count | Frequency (%) |
| 2022-02-13 17:40:39.244 UTC | 3 | < 0.1% |
| 2022-02-26 19:17:49.008 UTC | 2 | < 0.1% |
| 2022-02-04 23:57:35.289 UTC | 2 | < 0.1% |
| 2022-02-17 17:52:47.03 UTC | 2 | < 0.1% |
| 2022-02-25 12:51:13.228 UTC | 2 | < 0.1% |
| 2022-02-19 18:47:18.364 UTC | 2 | < 0.1% |
| 2022-02-13 10:57:14.24 UTC | 2 | < 0.1% |
| 2022-02-22 21:27:40.712 UTC | 2 | < 0.1% |
| 2022-02-02 15:15:35.008 UTC | 2 | < 0.1% |
| 2022-02-23 10:16:06.155 UTC | 2 | < 0.1% |
| Other values (2698303) | 2699698 |
Length
| Value | Count | Frequency (%) |
| utc | 2699719 | |
| 2022-02-27 | 103257 | 1.3% |
| 2022-02-26 | 101557 | 1.3% |
| 2022-02-20 | 101513 | 1.3% |
| 2022-02-25 | 97943 | 1.2% |
| 2022-02-19 | 97462 | 1.2% |
| 2022-02-22 | 96591 | 1.2% |
| 2022-02-13 | 95432 | 1.2% |
| 2022-02-21 | 95224 | 1.2% |
| 2022-02-06 | 94783 | 1.2% |
| Other values (2655782) | 4515676 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | user_pseudo_id | mission_id | mission_difficulty | mission_stars_collected | day_auto_increment | lifetime_played_runs | max_run_distance | total_purchases_virtual | total_ads_watched | total_purchases_real | geo_country | days_played_in_month | virtual_currency_balance | event_timestamp | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 15725157 | Mission94 | 2.0 | 3.0 | 0.0 | 2.0 | 1763.0 | 2000.0 | 0.0 | 0.0 | Honduras | 0.0 | 5000.0 | 2022-03-02 15:46:04.025 UTC |
| 1 | 0 | 15725157 | Mission11 | 1.0 | 8.0 | 3.0 | 19.0 | 2585.0 | 7000.0 | 0.0 | 0.0 | Honduras | 3.0 | 2446.0 | 2022-03-14 02:00:06.716 UTC |
| 2 | 0 | 15725157 | Mission6 | 1.0 | 11.0 | 3.0 | 20.0 | 2585.0 | 7000.0 | 0.0 | 0.0 | Honduras | 3.0 | 2752.0 | 2022-03-14 02:01:13.236 UTC |
| 3 | 0 | 15725157 | Mission3 | 1.0 | 7.0 | 2.0 | 12.0 | 2266.0 | 7000.0 | 0.0 | 0.0 | Honduras | 2.0 | 1306.0 | 2022-03-05 17:56:06.473 UTC |
| 4 | 0 | 15725157 | Mission114 | 1.0 | 6.0 | 2.0 | 9.0 | 2266.0 | 3500.0 | 0.0 | 0.0 | Honduras | 2.0 | 4284.0 | 2022-03-05 17:52:48.383 UTC |
| 5 | 0 | 15725157 | Mission86 | 2.0 | 12.0 | 3.0 | 21.0 | 2585.0 | 7000.0 | 0.0 | 0.0 | Honduras | 3.0 | 3322.0 | 2022-03-14 02:03:20.305 UTC |
| 6 | 0 | 15725157 | Mission115 | 1.0 | 5.0 | 0.0 | 8.0 | 2266.0 | 3500.0 | 0.0 | 0.0 | Honduras | 0.0 | 3561.0 | 2022-03-02 15:54:49.526 UTC |
| 7 | 0 | 15725157 | Mission12 | 2.0 | 14.0 | 3.0 | 23.0 | 2585.0 | 8500.0 | 0.0 | 0.0 | Honduras | 3.0 | 2820.0 | 2022-03-14 02:05:01.562 UTC |
| 8 | 0 | 15725157 | Mission109 | 1.0 | 4.0 | 0.0 | 3.0 | 2266.0 | 2000.0 | 0.0 | 0.0 | Honduras | 0.0 | 4238.0 | 2022-03-02 15:46:51.889 UTC |
| 9 | 0 | 15725157 | Mission19 | 2.0 | 8.0 | 3.0 | 19.0 | 2585.0 | 7000.0 | 0.0 | 0.0 | Honduras | 3.0 | 2446.0 | 2022-03-14 02:00:19.959 UTC |
Last rows
| df_index | user_pseudo_id | mission_id | mission_difficulty | mission_stars_collected | day_auto_increment | lifetime_played_runs | max_run_distance | total_purchases_virtual | total_ads_watched | total_purchases_real | geo_country | days_played_in_month | virtual_currency_balance | event_timestamp | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2699710 | 290201 | 57543559 | Mission3 | 1.0 | 6.0 | 1.0 | 4.0 | 1380.0 | 5500.0 | 2.0 | 0.0 | United Kingdom | 1.0 | 1587.0 | 2022-02-12 19:07:47.398 UTC |
| 2699711 | 290201 | 57543559 | Mission19 | 2.0 | 9.0 | 1.0 | 12.0 | 1740.0 | 5500.0 | 12.0 | 0.0 | United Kingdom | 1.0 | 10786.0 | 2022-02-12 19:27:31.618 UTC |
| 2699712 | 290201 | 57543559 | Mission11 | 1.0 | 9.0 | 1.0 | 12.0 | 1740.0 | 5500.0 | 11.0 | 0.0 | United Kingdom | 1.0 | 10786.0 | 2022-02-12 19:25:58.911 UTC |
| 2699713 | 290201 | 57543559 | Mission6 | 1.0 | 8.0 | 1.0 | 8.0 | 1740.0 | 5500.0 | 6.0 | 0.0 | United Kingdom | 1.0 | 3017.0 | 2022-02-12 19:15:31.415 UTC |
| 2699714 | 290201 | 57543559 | Mission114 | 1.0 | 6.0 | 1.0 | 4.0 | 1380.0 | 5500.0 | 2.0 | 0.0 | United Kingdom | 1.0 | 1587.0 | 2022-02-12 19:06:39.925 UTC |
| 2699715 | 290201 | 57543559 | Mission113 | 1.0 | 5.0 | 1.0 | 3.0 | 1380.0 | 5500.0 | 1.0 | 0.0 | United Kingdom | 1.0 | 645.0 | 2022-02-12 19:04:37.206 UTC |
| 2699716 | 290201 | 57543559 | Mission109 | 1.0 | 4.0 | 1.0 | 2.0 | 1380.0 | 0.0 | 0.0 | 0.0 | United Kingdom | 1.0 | 5667.0 | 2022-02-12 19:01:39.216 UTC |
| 2699717 | 290201 | 57543559 | Mission115 | 1.0 | 3.0 | 0.0 | 1.0 | 1380.0 | 0.0 | 0.0 | 0.0 | United Kingdom | 0.0 | 5588.0 | 2022-02-11 20:29:04.483 UTC |
| 2699718 | 290201 | 57543559 | Mission86 | 2.0 | 12.0 | 1.0 | 14.0 | 2801.0 | 5500.0 | 14.0 | 0.0 | United Kingdom | 1.0 | 12680.0 | 2022-02-12 19:38:52.784 UTC |
| 2699719 | 290201 | 57543559 | Mission10 | 1.0 | 14.0 | 2.0 | 21.0 | 4662.0 | 21000.0 | 22.0 | 0.0 | United Kingdom | 2.0 | 2628.0 | 2022-02-13 14:49:04.674 UTC |